The Best 780 Sequence Labeling Tools in 2025

Indonesian Roberta Base Posp Tagger
MIT
This is a POS tagging model fine-tuned based on the Indonesian RoBERTa model, trained on the indonlu dataset for Indonesian text POS tagging tasks.
Sequence Labeling Transformers Other
I
w11wo
2.2M
7
Bert Base NER
MIT
BERT fine-tuned named entity recognition model capable of identifying four entity types: Location (LOC), Organization (ORG), Person (PER), and Miscellaneous (MISC)
Sequence Labeling English
B
dslim
1.8M
592
Deid Roberta I2b2
MIT
This model is a sequence labeling model fine-tuned on RoBERTa, designed to identify and remove Protected Health Information (PHI/PII) from medical records.
Sequence Labeling Transformers Supports Multiple Languages
D
obi
1.1M
33
Ner English Fast
Flair's built-in fast English 4-class named entity recognition model, based on Flair embeddings and LSTM-CRF architecture, achieving an F1 score of 92.92 on the CoNLL-03 dataset.
Sequence Labeling English
N
flair
978.01k
24
French Camembert Postag Model
French POS tagging model based on Camembert-base, trained using the free-french-treebank dataset
Sequence Labeling Transformers French
F
gilf
950.03k
9
Xlm Roberta Large Ner Spanish
A Spanish named entity recognition model fine-tuned based on the XLM-Roberta-large architecture, with excellent performance on the CoNLL-2002 dataset.
Sequence Labeling Transformers Spanish
X
MMG
767.35k
29
Nusabert Ner V1.3
MIT
Named entity recognition model fine-tuned on Indonesian NER tasks based on NusaBert-v1.3
Sequence Labeling Transformers Other
N
cahya
759.09k
3
Ner English Large
Flair framework's built-in large English NER model for 4 entity types, utilizing document-level XLM-R embeddings and FLERT technique, achieving an F1 score of 94.36 on the CoNLL-03 dataset.
Sequence Labeling English
N
flair
749.04k
44
Punctuate All
MIT
A multilingual punctuation prediction model fine-tuned based on xlm-roberta-base, supporting automatic punctuation completion for 12 European languages
Sequence Labeling Transformers
P
kredor
728.70k
20
Xlm Roberta Ner Japanese
MIT
Japanese named entity recognition model fine-tuned based on xlm-roberta-base
Sequence Labeling Transformers Supports Multiple Languages
X
tsmatz
630.71k
25
Gliner Medium News V2.1
Apache-2.0
A fine-tuned version based on GLiNER, optimized for news entity extraction, achieving up to 7.5% higher zero-shot accuracy across 18 benchmark tests
Sequence Labeling English
G
EmergentMethods
532.81k
75
Fullstop Punctuation Multilang Large
MIT
A multilingual model for predicting punctuation in English, Italian, French, and German texts, designed to restore the punctuation structure of transcribed speech.
Sequence Labeling Transformers Supports Multiple Languages
F
oliverguhr
375.32k
163
Bert Base Multilingual Cased Ner Hrl
A multilingual named entity recognition model based on mBERT, supporting 10 high-resource languages, capable of identifying three types of entities: locations, organizations, and person names.
Sequence Labeling Transformers
B
Davlan
363.27k
72
Bert Large NER
MIT
A BERT-large fine-tuned named entity recognition model achieving state-of-the-art performance on the CoNLL-2003 dataset
Sequence Labeling English
B
dslim
360.98k
150
Ner French
Flair's standard 4-class French NER model, based on Flair word embeddings and LSTM-CRF architecture, achieves an F1 score of 90.61 on the WikiNER dataset.
Sequence Labeling French
N
flair
335.11k
13
Ner German Large
A built-in German 4-class large named entity recognition model in the Flair framework, based on XLM-R embeddings and FLERT technique, achieving an F1 score of 92.31 on the CoNLL-03 German dataset.
Sequence Labeling German
N
flair
297.28k
40
Xlm Roberta Base Romanian Ner Ronec
A named entity recognition model trained on the Romanian NER dataset RONEC based on the xlm-roberta model, achieving an f1-Macro score of 95 on the test set.
Sequence Labeling Transformers Other
X
EvanD
283.26k
3
Distilbert Base Multilingual Cased Ner Hrl
A named entity recognition model for 10 high-resource languages, based on a fine-tuned Distil BERT base model, capable of identifying three types of entities: locations, organizations, and persons.
Sequence Labeling Transformers
D
Davlan
270.56k
78
Wikineural Multilingual Ner
A multilingual named entity recognition model combining neural networks and knowledge bases, supporting 9 languages
Sequence Labeling Transformers Supports Multiple Languages
W
Babelscape
258.08k
142
Roberta Large Ner English
MIT
An English named entity recognition model fine-tuned on RoBERTa-large, trained on the conll2003 dataset, specifically optimized for entity recognition in email/chat data.
Sequence Labeling Transformers English
R
Jean-Baptiste
236.85k
71
Camembert Ner
MIT
A Named Entity Recognition (NER) model fine-tuned on the wikiner-fr dataset based on camemBERT, excelling in handling named entity recognition tasks in French texts.
Sequence Labeling Transformers French
C
Jean-Baptiste
230.81k
110
Camembert Ner With Dates
MIT
A French named entity recognition model fine-tuned based on camemBERT, with added date label functionality
Sequence Labeling Transformers French
C
Jean-Baptiste
219.11k
43
Ner English Ontonotes Large
Flair's built-in large English named entity recognition model for 18 classes, trained on the Ontonotes dataset using XLM-R embeddings and FLERT technology.
Sequence Labeling English
N
flair
176.21k
96
Ner English Ontonotes
Flair's built-in English 18-class named entity recognition model, trained on the Ontonotes dataset with an F1 score of 89.27.
Sequence Labeling English
N
flair
175.71k
19
Roberta Large Tweetner7 All
A named entity recognition model fine-tuned on the tner/tweetner7 dataset based on roberta-large, specifically designed for entity recognition in Twitter text
Sequence Labeling Transformers
R
tner
170.06k
1
Sat 3l Sm
MIT
State-of-the-art sentence segmentation technology using a 3-layer Transformer architecture, supporting multilingual text segmentation.
Sequence Labeling Transformers Supports Multiple Languages
S
segment-any-text
168.01k
6
Albert Tiny Chinese Ws
Gpl-3.0
Provides Traditional Chinese transformers models and natural language processing tools
Sequence Labeling Transformers Chinese
A
ckiplab
166.28k
6
Ner Dutch Large
Flair's built-in Dutch 4-category named entity recognition large model, based on XLM-R embeddings and FLERT technology, achieves an F1 score of 95.25 on the CoNLL-03 Dutch dataset.
Sequence Labeling Other
N
flair
147.32k
9
Layoutreader
A reading order prediction model that converts text boxes extracted from PDF or detected by OCR into a readable sequence.
Sequence Labeling Transformers
L
hantian
139.61k
27
Deid Bert I2b2
MIT
This model is used to identify and remove protected health information (PHI/PII) from medical records in compliance with HIPAA privacy standards.
Sequence Labeling Transformers Supports Multiple Languages
D
obi
129.39k
21
Bert Base NER Russian
MIT
A Russian text named entity recognition (NER) model fine-tuned based on bert-base-multilingual-cased, using BIOLU annotation format, capable of recognizing various entity types such as person names, locations, and organizations.
Sequence Labeling Transformers Other
B
Gherman
128.72k
7
Ner English
Flair's built-in standard English 4-category named entity recognition model, based on Flair embeddings and LSTM-CRF architecture, achieving an F1 score of 93.06 on the CoNLL-03 dataset.
Sequence Labeling English
N
flair
127.67k
34
Piiranha V1 Detect Personal Information
Piiranha-v1 is a fine-tuned model based on microsoft/mdeberta-v3-base, specifically designed to detect 17 types of personally identifiable information (PII) in six languages.
Sequence Labeling Transformers Supports Multiple Languages
P
iiiorg
125.41k
181
Bert Base Turkish Cased Ner
MIT
A Turkish named entity recognition model fine-tuned based on the dbmdz/bert-base-turkish-cased model, capable of recognizing entities such as person names, organization names, and location names.
Sequence Labeling Transformers Other
B
akdeniz27
115.25k
23
Biomedical Ner All
Apache-2.0
An English named entity recognition model trained on distilbert-base-uncased, specifically designed for identifying biomedical entities (107 entity categories), suitable for text corpora such as case reports.
Sequence Labeling Transformers English
B
d4data
112.41k
165
Bpmn Information Extraction V2
Apache-2.0
A BPMN process information extraction model fine-tuned based on bert-base-cased, used to extract key elements such as executors and tasks from textual process descriptions
Sequence Labeling Transformers
B
jtlicardo
112.15k
14
Zh Wiki Punctuation Restore
A tool for restoring punctuation marks in Chinese Wikipedia texts, supporting the restoration of 6 common punctuation marks.
Sequence Labeling Transformers Supports Multiple Languages
Z
p208p2002
102.99k
11
Xlm Roberta Large Finetuned Conll03 English
Named entity recognition model fine-tuned on the English CoNLL2003 dataset based on XLM-RoBERTa-large
Sequence Labeling Supports Multiple Languages
X
FacebookAI
84.75k
169
Bert English Uncased Finetuned Pos
A model for Chinese part-of-speech tagging, supporting 17 common POS tags.
Sequence Labeling
B
vblagoje
79.89k
40
Bert Spanish Cased Finetuned Ner
A fine-tuned version based on the Spanish BERT cased model (BETO) on the NER-C dataset, specifically designed for Named Entity Recognition (NER) tasks.
Sequence Labeling Spanish
B
mrm8488
77.49k
21
Bert Fa Base Uncased Ner Peyma
Apache-2.0
A Transformer-based Persian language understanding model, reconstructed vocabulary and fine-tuned on new corpora, expanding multi-domain application capabilities
Sequence Labeling Other
B
HooshvareLab
69.74k
7
Gliner Multi Pii V1
Apache-2.0
GLiNER is a Named Entity Recognition (NER) model capable of identifying various types of Personally Identifiable Information (PII).
Sequence Labeling Supports Multiple Languages
G
urchade
67.78k
107
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase